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METHOD AND APPARATUS FOR OPTIMIZING QUANTIZER 
VALUES IN AN IMAGE ENCODER 

BACKGROUND OF THE INVENTION 

5 The invention relates to computing quantization values used for encoding 

coeffients of a digital image or video frame and more particularly to optimizing 
the computed quantization values to reduce distortion in the digital image or video 
frame when encoding is performed with a limited number of bits. 

In many of today's image and video coders, the quality of encoded images 

10 is controlled by selecting one or more quality parameters. Block-based image and 
video coders, use a parameter known as a quantization scale or step for each block 
of pixels in the image. The quantization steps are used for scaling pixel values 
within the same step ranges to the same values. Image blocks encoded with the 
same quantization scale have approximately the same quality. The number of bits 

1 5 needed for encoding an image depends on desired image quality (quantization 
scales) and on the inherent statistics of the image. As a result, different images 
encoded with the same scales (same image quality) will occupy a different number 
ofbits. 

In many applications, the number ofbits available for encoding one or 
20 several frames is fixed in advance, and some technique is necessary to select the 
quantization scales that will produce that target number ofbits and encode the 
video frames with the highest possible quality. For example, in a digital video 
recording, a group of frames (GOP) must occupy the same number of bits for an 
efficient fast-forward/fast-rewind capability. In video telephony, the channel rate, 
25 communication delay, and size of encoder buffers determine the available number 
ofbits for one or more frames. 

Existing quantizer or buffer control methods are classified into three major 
types. A first type of quantizer control method encodes each image block several 
times with a set of quantization scales. The number ofbits produced for each case 
30 is measured and a scale for each block is smartly selected so the sum of the bits 
for all combined blocks hits the desired target bit number. The first type of 
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quantizer control techniques cannot be used for real-time encoding because of the 
high computational complexity required to encode each image block multiple 
times. 

The first type of quantizer control is described in the following 
5 publications: K. Ramchandran, A. Ortega, and M. Vetterli, "Bit Allocation for 
Dependent Quantization with Applications to Multi-Resolution and MPEG Video 
Coders,** IEEE Trans, on Image Processing, Vol. 3, N. 5, pp. 533-545, September 
1994; W. Ding and B. Liu, "Rate Control of MPEG Video Coding and Recording 
by Rate-Quantization Modeling,** IEEE Trans, on Circuits and Systems for Video 
1 0 Technology, Vol. 6, N. 1, pp. 12-19, February 1996; and LJ. Lin, A. Ortega, and 
C.C. J. Kuo, "Rate Control Using Spline-Interpolated R-D Characteristics," Proc. 
of SPIE Visual Communications and Image Processing, pp. 1 1 1-122, Orlando, FL, 
March 1996. 

A second type of quantizer control technique measures the number of bits 

1 5 spent in previously encoded image blocks and measures other parameters such as, 
buffer fullness, block activity, etc. These measurements are used to select the 
quantization scale for the current block. The second type of quantizer control is 
popular for real-time encoding because of its low computational complexity. 
However, the second type of quantizer control is inaccurate in achieving the target 

20 number of bits and must be combined with additional encoding techniques to 
avoid bit or buffer overflow and underflow. 

The second method is described in the following publications: U.S. Patent 
No. 5,038,209 entitled "Adaptive Buffer/Quantizer Control for Transform Video 
Coders", issued August 6, 1991 to H.M. Ming; U.S. Patent No. 5,159,447 entitled 

25 "Buffer Control for Variable Bit-Rate Channel", issued October 27, 1992 to B.G. 
Haskell and A.R. Reibman; and U.S. Patent No. 5,141,383 entitled "Pseudo- 
Constant Bit Rate Video Coding with Quantization Parameter Adjustment", issued 
August 31, 1993 to C.T. Cheng and A.H. Wong. 

A third type of quantizer control technique uses a model to predict the 

30 number of bits needed for encoding the image blocks. The quantizer model 
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includes the blocks' quantization scales and other parameters, such as, block 
variances. The quantization scales are determined by some mathematical 
optimization of the encoder model. The third type of quantizer control is 
computationally simple and can be used in real-time, but is highly sensitive to 
5 model errors and often produces inaccurate results. 

The third type of quantizer control technique is described in the following 
publications. E.D. Frimout, J. Biemond, and RX. Lagendik, 'Torward Rate 
Control for MPEG Recording, 5 * Proc. of SPIE Visual Communications and Image 
Processing, Cambridge, MA, pp. 184-194, November 1993; US'. Patent No. 

1 0 5,323,1 87 entitled, "Image Compression System by Setting Fixed Bit Rates", 
issued June 21, 1994 to K. Park and A. Nicoulin, "Composite Source Modeling 
for Image Compression,** PhD. Thesis N. 1444 (1995), Ecole Polytechnique 
Federale de Lausanne, Chapter 6, 1 995. 

Thus, a need remains for improving the image quality of quantized image 

15 or video frames while reducing the time and computational complexity required to 
generate optimized quantization values. 

SUMMARY OF THE INVENTION 
A quantizer controller generates quantization values using a new block- 
adaptive, Lagrangian optimization. The quantizer controller is updated and 

20 improved using information from earlier quantized blocks. The quantizer 
controller is robust to model errors and produces results as accurate as type-1 
quantizer control techniques, while having the simpler computational complexity 
of the type-2 quantizer control techniques. 

The quantizer controller identifies a target bit value equal to a total number 

25 of bits available for encoding the frame. A total amount of distortion in the frame is 
modeled according to the predicted quantization values assigned to each one of the 
blocks. The predicted quantization values are characterized according to an amount 
of energy in each block and a number of bits available for encoding each block. 
Optimum quantization values are adapted to each block by minimizing the modeled 

30 distortion in the frame subject to the constraint that the total number of bits for 
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encoding the frame is equal to the target bit value. Each block is then encoded with 
the optimized quantization value. 

The quantizer controller is adaptive to each block by reducing quantization 
values for the blocks having less energy and increasing the quantization values for 
5 the blocks having more energy. The quantization values assigned to the blocks are 
also optimized according to a number of image blocks remaining to be encoded and 
a number of bits still available for encoding the remaining image blocks. 

Different weighting factors are optionally applied to the quantization values 
that vary the accuracy of the encoded blocks. One weighting factor is applied to the 
1 0 quantization values according to the location of the block in the frame. Optimized 
quantization values are applied to blocks in each frame, frames in a group of 
multiple frames or applied generally for any region in an array of image data. 

The quantizer controller only encodes the image once to accurately 
generate the quantization values for each block. The quantization values produce 
15 a target number of bits for the encoded image or video frame. Thus, the quantizer 
controller is less computationally exhaustive than a quantizer control technique of 
similar accuracy. 

The general framework of the quantizer controller can be used in a variety 
of quantizer/rate control strategies. For example, the quantizer controller can be 
20 used to select in real-time the value of the quantization scales for the Discrete 
Cosine Transform DCT-based encoding of the frame macroblocks in the current 
video coding standards MPEG 1-2 and 4, H.261, H.263, and H.263+. A frame, 
several frames, or several macroblocks within a frame are encoded with a fixed 
number of bits. 

25 The foregoing and other objects, features and advantages of the invention 

will become more readily apparent from the following detailed description of a 
preferred embodiment of the invention, which proceeds with reference to the 
accompanying drawings. 
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BRIEF DESCRIPTION OF DRAWINGS 

FIG. 1 is a schematic diagram of multiple image frames each including 
multiple blocks assigned optimized quantization values according to the invention. 
FIG. 2 is a block diagram of an image coder according to one embodiment 
5 of the invention. 

FIG. 3 is a step diagram for generating the optimized quantization values. 
FIGS. 4 and 5 show results from applying the optimized quantization 
values to image data. 

FIG. 6 is a block diagram of the quantizer controller according to one 
1 0 embodiment of the invention. 

DETAILED DESCRIPTION 
A block-based image coder 12 is used to describe the invention. However, 
the invention can be used for controlling the quantizer of any image or video 
coder. 

1 5 Referring to FIG. 1 , in block-based image coding, images 1 5 are 

transmitted in multiple frames 26. Each frame 26 is decomposed into multiple 
image blocks 14 of the same size, typically of 16x16 pixels per block. The 
number of bits B t produced after encoding an itfa image block 14, is a function of 
the value of a quantization parameter Qj and the statistics of the block. For 

20 example, image block i = 9 contains more image information (energy) a. than 

image block i = 17. This is because the image in block i = 9 contains portions of a 
facial image along with background information. Conversely, image block i = 17 
has less image information energy a, because it contains substantially the same 
background imagery in substantially each pixel location. 

25 Referring to FIG. 2, the pixel values for each image block 14 are 

transformed into a set of coefficients, for example using a Discrete Cosine 
Transform (DCT) in block transform 16. These coefficients are quantized in block 
quantization 18 and encoded in coder 20. Bits B t of the encoded and quantized 
image blocks 14 are then transmitted over a communication channel 21 over a 

30 telephone line, microwave channel, etc. to a receiver (not shown). The receiver 
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includes a decoder that decodes the quantized bits and an inverse transform block 
that performs an Inverse Discrete Cosine Transform (EDCT). The decoded bits Bj 
are then displayed on a visual display screen to a user. 

Quantization of the transformed coefficients in quantization block 1 8 is a 
5 key procedure since it determines the quality with which the image block 14 will 
be encoded. The quantization of the ith block 14 is controlled by the parameter, 
Q,. In the H.261 and H.263 video coding standards, Qj is known as the 
quantization step for the ith block and its value corresponds to half the step size 
used for quantizing the transformed coefficients. In the MPEG-1 and MPEG-2 
1 0 standards, Qi is called the quantization scale and the jth coefficient of a block is 
quantized using a quantizer of step size Q t Wj , where Wj is the jth value of a 
quantization matrix chosen by the designer of the MPEG codec. 

Let N be the number of 16x16 image blocks in one image frame 26. The 
total number of bits B available for encoding one image frame 26 is: 

15 

B = B, + B 2 + B 3 + . . . + B N , (1) 

where the value of B depends on the quantization parameters selected, 
Qi> Q2» • • •» On. th e statistics of the blocks. The invention comprises a 
quantizer controller 22 (FIG. 1) that chooses optimum values for the Q;*s for a 
limited total number of available bits B for encoding the frames 26. The quantizer 
controller 22 is implemented in a variety of different maps including in software 
in a programmable processing unit with dedicated hardware. 

In image coding, the image blocks 14 are said to be intracoded or of class 
intra. In video coding, many of the blocks 14 in a frame 26 are very similar to 
blocks in previous frames. The values of the pixels in a block 14 are often 
predicted from previously encoded blocks and only the difference or prediction 
error is encoded. These blocks are said to be intercoded or of class inter. The 
invention can be used in frames with both intra and inter blocks. 
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Encoder Model 

The following model in equation 2 identifies the number of bits invested in 
the ith image block : 

a 2 

B^ACK^r + C), (2) 

5 The value Q, is die quantizer step size or quantization scale, A is the number of 
pixels in a block (e.g., in MPEG and H.263 A = 1 6 2 pixels), K and C are 
constants, and o, is the empirical standard deviation of the pixels in the block, 



«,-^t(W)-J5> a • (3) 

10 

The value P0 is the jth pixel in the ith block and P, is the average of the pixel 
values in the block where, 



15 



P^^tP.U). (4) 



For color images, the Pff) 's are the values of the luminance and 
chrominance components of the respective pixels. The model in equation 2 is 
derived using a rate-distortion analysis of the block's encoder. The value of K in 
equation 2 depends on the statistics of the image blocks 26 and the quantization 

20 matrix used in the encoder. For example, it can be shown that if the pixel values 
are approximately uncorrelated and Gaussian distributed, and the quantization 
matrix is flat with unitary weights (i.e., Wj =1 for all j), then K=7i /ln2. The 
constant C in equation 2 models the average number of bits per pixel used for 
encoding the coder's overhead. For example, C accounts for header and syntax 

25 information, pixel color or chrominance components, transmitted Q values, 
motion vectors, etc. sent to the receiver for decoding the image blocks. If the 
values of K and C are not known, they are estimated with an inventive technique 
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described below in the section entitled, "Updating the Parameters of the Encoder 
Model". 

Equation 5 models distortion D for the N encoded blocks, 

i N n 2 

N£? ' 12 ' K) 
5 where the a ^ s are weights chosen to incorporate the importance or cost of the 
block distortion. For example, larger a j's are chosen for image blocks having 
artifacts more visible to the human eye or for image blocks that belong to more 
important objects in the scene. If a , =a 2 = . . . =a N = 1, the distortion 
represented by equation 5 is approximately the mean squared error (MSE) 
1 0 between the original and encoded blocks. 
Optimization 

The quantizer controller 22 (FIG. 1) selects the optimal quantization 
values, Q*„ Q% • • • » Q*n» ^ at minimize the distortion model in equation 5, 
subject to the constraint that the total number of bits must be equal to B as defined 
15 in equation 1 , which can be expressed mathematically as follows: 



j N Q 2 

...,Q;=^gmin -£<*, 2 T7 (6a) 

ZBj= B 



The next objective is to find a formula for each of the Q] *s. To do this, the method 
20 of Lagrange is used to convert the constrained-minimization in equation (6a) to the 
following: 

Qi >"->Qn = 

Qi_Qn N i=i 12 Vj-i J 
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15 



where X is called the Lagrange multiplier. Next, equation (2) is used for Bj in (6b) 
to obtain: 

Qi »-"»Qn = 



Q„->Q N N/oi 12 V Qj 



Finally, by setting partial derivatives in (6c) to zero, the following formula is 
5 derived for the optimal quantizer step size for the i-th image block: 



AK ^Za.o, . (6) 



ANC)a. *„, 



Moreover, if i-1 blocks 26 have already been quantized and encoded, the optimal 
quantization parameter for the ith block is, 



q; = L AK x ^t^, (?) 

where N ; = N-i+ 1 is the number of image blocks that remain to be encoded and 
Bj is the number of bits available to encode them, 



B, = B- ZB, = B w - B M = B w - a( + C ) , 



(8) 



where B M is obtained using equation 2 with the optimized quantization value Q*_, . 
Thus, equations 6 and 7 generate optimized quantization values that minimize 
20 distortion for a limited number of available bits. As a result, using the same 

number of bits, the image in frame 26 in FIG. 1 will have less distortion than other 
quantization schemes when displayed on a display unit at the receiver end of the 
channel 21. 
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QUANTIZER CONTROL METHOD 

FIG. 3 describes the steps performed by quantizer controller 22 (FIG. 2) 
for selecting quantizer values used for encoding N image blocks 14 with B bits. 
5 Note that N could be the number of blocks in an image, part of an image, several 
images, or generally any region of an image. 
Step 1* Receive energy values and initialization. 

Pixel values for the N image blocks are obtained to the quantizer controller 
22 from the digital image (FIG. 2) in step 1 A. Initialization is performed in step 

1 0 IB by setting i = 1 (first block), B, = B (available bits), N , = N (number of 



/T 

blocks). Let S, = Ea t a 4 , where the a k 's are found using equation 3 and the 

a k *s are preset (e.g., set a , =a 2 = ... =a N = 1 to minimize mean squared error). 
In one example, the amount of energy a, is derived from the DCT coefficients of 
the pixel values generated by transform block 16. 



model in equation 7 are known or estimated in advance. For example, using linear 
regression, K, =K and C, = C. For an adaptive mode> the model parameters are 
not known, K, and C, are then set to some small non-negative values. For 
example, experiments have shown K, = 0.5 and C, = 0 to be good initial 

20 estimates. In video coding, K, and C, can be set to the values K^, and C^, , 
respectively, from the previous encoded frame. 
Step 2. Compute the optimal quantization parameter for the ith block. 

If the values of the Q-parameters are restricted to a fixed set (e.g., in 
H.263, QP=Qj/2 and takes values 1,2,3,..., 31,), Qj* is rounded to the nearest value 

25 in the set. The square root operation is then implemented using look-up tables, 



N 



15 



For a fixed mode, the values of the parameters K and C in the encoder 



where 
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Step 3. Encode the ith block with a block-based coder. 

B[ is the number of bits used to encode the ith block, compute 

5 B j+ , =S, -B{, S M =S,-a/j f . andN^N.-l. 

Step 4. Update quantizer values. 

In step 4, the parameters K^, and C m are updated in the quantizer 

controller 22. For the fixed mode, K*, = K, C HI = C. For the adaptive mode; the 
1 0 updates K*, and C m are found using a model fitting technique. One example of a 

model fitting technique is described below in the section entitled "Updating 

Parameters in the Quantizer controller 1 *. 

StepS. Generate quantizer value for next block. 

If i ~ N in decision step 5, quantization values have been derived for all image 
1 5 blocks 14 and die quantizer controller 22 stops. If all of the image blocks 14 have 

not been quantized, the quantizer controller 22 receives the coefficients for the 

next image block i =i+l in step 6, and jumps back to step 2. The quantization 

value for block i = i+1 are then derived as described above. 

Referring to FIGS. 4 and 5, the frames of video sequences encoded by 
20 quantizer controller 22 where compared to those of a Telenor H.263 offline 

method, which is the quantizer control technique adopted for MPEG-4 anchors. 
In FIG. 4, the total number of bits per video frame obtained by the 

quantization technique described in FIG. 3 are shown in solid line. The H.263 

offline encoding technique is shown in dashed line. Encoding was performed on 
25 133 frames of a well-known video sequence "Foreman". The target number of 

bits B is 1 1200 bits per frame. FIG. 5 is like FIG. 4, but for 140 frames of the 

video sequence "Mother and Daughter" with B=6400. 

The quantizer controller 22 produces a significantly more accurate and 

steady number of bits per frame. Similar results were obtained for a wide range of 
30 bit rates. In the experiments, there were little if no visible differences in the 

quality of the two encoded video sequences. The signal to noise ratio performance 

11 
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of the images processed by quantizer controller 22 was only 0.1-0.3 dB lower on 
average. Thus, even though the image is only encoded once, quantizer controller 
22 achieves the target bit rate accurately with high image quality at every frame. 
Alternative Implementations 
5 Several quantization variations are based on the base quantization 

optimization framework discussed above. If the computation of all a k *s in Step 
IB of FIG. 3 cannot be performed in advance, a good estimate for S, is used, such 
as the value of S, from the previous video frame 26. 

A low-complexity estimate of S, can be used, in order to further reduce 
1 0 computational complexity. For the low complexity estimate, equation 3 is 
replaced by equation 9, 

^4^0')^), (9) 

1 5 where abs(x) is the absolute value of x. In video coding, the mean value of pixels 
in inter blocks is usually zero and hence equation 9 may be simplified by setting 

J>=0. 

A fixed optimization selects the quantization parameters using equation 6 
instead of equation 7. To do this, in step 3 in FIG. 3, the values for B^, , S i+I , and 

20 N KI are replaced by B i+I =B, S^sS,, and N i+1 = N , respectively. 

For a variable-rate channel, if the number of bits available after encoding i 

blocks changes to B , because of a change of channel rate or other factors, set 

B j+1 = B in step 3. 

The quantization model defined in equation 2 can be generalized to 
25 equation 10, 

B^A.CK^ + C), 00) 
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where A, is the number of pixels in the jth region. The region of quantization does 
not need to be a block. Additional model parameters <|> and y can either be set 
prior to quantization or obtained using parameter estimation techniques described 
below. If the quantization model in equation 10 is used in step 2, the optimized 
quantization values Q* 's are derived using equation 11, 



V 



(11) 



N — ^ 

in step 1, S, is replaced by S l = ECA.a*) 7 * 2 ^* 2 . In step 3, S i+I is replaced 



1 0 Encoding Intra and Inter Blocks. 

If some of the blocks to be encoded are of class intra (in the same frame) 
and some inter (between different frames), performance of the quantizer controller 
22 can be improved by dividing the standard deviation of the intra blocks by a 

factor ^/p . Specifically, after computing the value for the a, *s in step 1, the 
15 factor ^/p is applied as follows: 



—j= if kth block is intra 
o\ otherwise 



The factor 0 is, 



P = 



20 The values K 7 and K P are the averages of the K*s measured for the intra and inter 
blocks, respectively. The value of P is estimated and updated during encoding. 
During experimentation it was found that using a constant P = 3 works well. 
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Frame-Based Quantizer Control. 

If the quantization step is fixed for all the blocks 14 within a frame 26, the 
same quantizer controller 22 shown in FIG. l can be used for encoding one or 
several frames. The parameters are reinterpreted so that: N = Number of frames, 
5 B = Bits available for encoding the N frames, i = Frame number in the video 
sequence, Qj = Quantization step for all the blocks in the ith frame, and A = 
Number of pixels in a frame. 

The parameters a |9 a„B!» are the weight, variance, and bits for the ith 
frame, respectively. The parameters and C t are updates of the coder model for 
1 0 that frame. 

If computational complexity is not an issue, each image block 14 can be 
encoded several times and, using a classical model fitting procedure (e.g., least- 
squares fit, linear regression, etc.), a good estimate of the K/s and Q's for the 
blocks can be found in advance. Then, in step 2, the quantization values Q* are 
1 5 determined according to, 



In Step 1, S, is replaced by S } = T,y[K^<y k oi k , and in step 3, S i+I is replaced by 



20 S„j =S t - jK^ofiii . To reduce the complexity, one can set C=C f=0 and avoid 
the updating and computation of this model parameter. In that case, observe that 
Q* in step 2 is simply, 



1 U, «. s " 



or equivalent!^ 
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Q ' = VV where A i =AK j and T. = ~^T LB i- 

Any subset of the different techniques described in "Alternative Implementations" 
can be combined and used together. 
Updating Parameters of the Encoder Model. 
5 The following is one technique for updating the parameters K^, and Q+, in 

the quantizer controller 22. This update technique is used with the adaptive mode 
described above in step 4. Other classical parameter estimation or model fitting 
techniques such as least squares, recursive least squares, Kalman prediction, etc. 
can alternatively be used. The model parameters can be updated in every block, 
1 0 frame, group of blocks, or group of frames. 

The model parameters in one embodiment of the invention, are updated or 
estimated on a block-by-block basis using the following weighted averages, 



A_2 9 i a »n* 



Ao? ' ' A Q*? ' 

15 

The values of K and C predict B[ using equation 2. Alternatively, in some codecs 
these formulas are used for measuring K, and C. : 

xr B 'pctj Q? - A A B ?~ B 'pcr,i 

K i= — ^,and C,=— — , 

where is the number of bits spent for the DCT coefficients of the i-th image 
20 block. 

The averages of the K/s and C^s are computed as follows, 



_ i-l~ 1 * ~ 1 * 

K^V^+TKi. and Q=— C M+T C r 
li ii 
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The updates are a linearly weighted average of K, , C s and their respective initial 
estimates K„ C„ 

5 

If the general model in equation 10 is used, a variety of estimators can be used to 
estimate the additional parameters <|> and y . These parameters can also be 
updated on a block-by-block basis, and the ith updates ^ and y , can be found 
using averaging techniques similar to those for Kj, Q . 

1 0 Selection of the a, Weights. 

The a s values can be chosen to incorporate the importance or weight of 
block distortions. If default values are used a , =a 2 = . . . =a N = 1, the MSB 
distortion is minimized between the original and the encoded blocks. Otherwise, 
the MSE distortion decreases in blocks with larger a j's and increases where the 
15 a j's are smaller. Two examples are described below for choosing the a { weights. 
A region, such as a rectangular window in a video telephone image, is assigned a 
larger value of a { and, in turn, smaller quantization values. The weighted region 
will be coded with better quality since a smaller quantization scale is used to 
quantize pixel values. 

20 People usually pay more attention to the central region of a picture. Thus, 

larger values of a { are assigned to the regions near the center of the picture. A 
pyramid formula is used to assign larger values of a , to blocks closer to the center 
of the frame. Specifically, let B x and B Y be the number of blocks along the 
horizontal and vertical coordinates, respectively. The weight for the i-th block is 

25 computed as follows, 

«, =a,(l-|b„ -^|£)(l-|b, «>. 
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where (a, + and as are the height and offset of the pyramid, respectively, and b x 
and by are the horizontal and vertical position of the block in the frame. For 
example, choosing a,=15 and a^l causes the a { value of the center block to be 16 
5 times that of boundary blocks. 
Block Joining 

In a codec, the values of the quantizer values used Q*„ ■ • Q* N (see step 2 
above) need to be encoded and sent to the decoder. For example, in H.263, 
quantizer values are encoded in a raster-scan order and there is a five-bit penalty 

1 0 for changing the quantizer value. At high bit rates, the bit overhead for changing 
the quantizer values is negligible and the optimization techniques described above 
are effective. However, at very-low bit rates, this overhead is significant and 
some technique is needed for constraining the number of times that the quantizer 
changes. Unfortunately, existing optimization methods that take quantization 

1 5 overhead into account are mathematically inaccurate or computationally 
expensive. 

In another aspect of the invention, a heuristic method joins blocks of 
similar standard deviation together into a set so that the quantizer value remains 
constant within the set. This technique is referred to as block joining, and reduces 
20 the changes of the quantizer at lower bit rates. Block joining is accomplished by 
choosing the values of the a, weights as follows, 



a,= 



1 otherwise 



25 where B/(AN) is the bit rate in bits per pixel for the current frame. The values of 
B, A, and N were defined earlier as the number of bits available, number of pixels 
in a block, and number of blocks, respectively. If the bit rate is above 0.5, the a, *s 
are all equal to 1 and hence have no effect. At lower bit rates, the a,- 's linearly 
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approach the respective a, *s and progressively reduce the range of the Q**s. In 
fact, ifthe bit rate is 0, then a, = <r, and all the quantizer values are equal, and 
hence all blocks are joined into one set, 

J TV n * 

(b-anc)^ * 

FIG. 6 is a detailed block diagram of the quantizer controller 22 shown in 
FIG. 2. The quantizer controller 22 in one embodiment is implemented in a 

10 general purpose programmable processor. The functional blocks in FIG. 6 
represent the primary operations performed by the processor. Initialization 
parameters in block 31 are either derived from pre-processing the current image or 
from parameters previously derived from previous frames or from prestored values 
in processor memory (not shown). Initialization parameters include N„ S„ B„ K, 

1 5 and C, (or K^, and Q*, from a previous frame). 

The image is decomposed into N image blocks 14 (FIG. 2) of A pixels in 
block 30. The energy of the pixels in each block is computed in block 32. The 
weight factors assigned to each block are computed in block 34. Hie amount of 
energy left in the image is updated in block 40 and the bits left for encoding the 

20 image are updated in block 38. Parameters for the encoder model are updated in 
block 42 and the number of blocks remaining to be encoded are tracked in block 
44. The processor in block 36 then computes the optimized quantizer step size 
according to the values derived in blocks 32, 34, 40, 38, 42 and 44. 

Having described and illustrated the principles of the invention in a 

25 preferred embodiment thereof it should be apparent that the invention can be 
modified in arrangement and detail without departing from such principles. I 
claim all modifications and variation coming within the spirit and scope of the 
following claims. 
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